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DUPLICATE 



CASCADE SYSTEM FOR NETWORK UNITS 
Field of the Invention 

This invention relates to packet-switched communication networks. More 
particularly the invention relates to networks compatible with 'Ethernet' 
transmission formats and protocols. The invention specifically concerns 

what is known as the 'cascading' of switches or other units in or for such 
a network. Various aspects of the invention concern the logical 
management, control and signal formats preferably incorporated in a 
cascade. 

Background to the Invention 

The physical elements in a packet-based data communication system 
commonly include multi-port units such as switches and routers. Such 
units are commonly and conveniently manufactured with a fixed plurality 
of ports by means of which the units can be connected by way of 
appropriate transmission links (such as cables or optical fibres) to other 
units or user terminals. In order to provide greater versatility in 
constructing networks while minimising the number of different sizes of 
units that need to be deployed it is known from EP-0912944 and also 
various s\A'itches, such as the Super Stack 3 Switch 3300 made by 3Com 
Corporation, to render the units 'stackable' by which is meant that a 
multiplicity of them can be interconnected to form effectively a' single 
controUabie entity. It is customary to make such units physically 
stackable in a column although this physical aspect of the term is not 
essential to the meaning of 'stackable' or to the present invention. 

In order to provide intercommunication between the units, so that for 
example packets received at any of the ordinary ports (commonly termed 



Tront panel' ports) can be forwarded from a port or ports on another unit 
or units, depending on the addressing of the packets and the nature of 
them, It IS customary to connect stacked units together by means of a 
cascade, which is the term used for the communication links and 
associated control functions by means of which packets are transmitted 
between the units in the stack. 

The design and organisation of a cascade are attended by some 
considerable complexity. For example, the forwarding of packets on the 
cascade has to be controlled according to a variety of rules which are 
intended ro conform, for each particular unit, to the forwarding rules 
relevant for different types of packet, such as unicast, broadcast and 
multicast, as well as other relevant rules such as those relating to 
bridging. The rules may need to be preserved or modified to make the 
operation of the cascade more convenient or more versatile. Furthermore, 
as will be more particularly appreciated hereinafter, it is desirable 'to" 
employ a packet format which accommodates information particular to 
the operation of the cascade. One example among many, is, as described 
in GB-2359692, the modification of packets to enable them to obey 
proper forwarding rules when ports on some but not necessarily all the 
units are members of a trunk, namely a multi-link connection from those 
ports to another unit or stack of units. 

An added difficulty arises when it is desired to form a cascade connection 
of units such that the connection can accommodate the addition of units 
into the cascade or the removal of units from the cascade or even the 
failure of one or more units in the cascade or failure or removal of links 
(such as cables) between units without requiring manual adaptation or 
reconfiguration and which allows the augmented or diminished or 
partially defective cascade to continue operation after only a momentary 
delav. The terms 'hot-swap' or 'hot insertion' are conveniently used to 



refer to the exchange or insertion of units which allows the cascade 
connectior to function immediatel}^ after the swapping or insertion 
process takes place. The term 'self-healing' may be used to indicate the 
nature of a cascade which can, despite the failure of a unit or link, 
5 continue to function in respect of the other units in the cascade. 

It is known from pubhshed GB patent application No. 2357390 to provide 
a limited form of self-healing, employing a single bidirectional cascade 
which can revert to a unidirectional cascade if a link or unit fails. That 
10 earlier proposal requires in its described form complex forwarding rules 

which are not easy to accommodate with other forwarding rules that the 
units may need to observe. The proposal does not indicate how hot 
insertion or hot swapping may be achieved in practice. 

15 In the 3Com switch type 4400 and as described in co-pending application 

No. 0018713.8 a cascade data path and a cascade control path can be 
maintained irrespective of the operational state of the network units in a 
stack by means of three-port connectors (known as T-pieces). These 
connectors have internal hardware processing circuits coupled to a 

20 'Down' port, an Up' port and a module port which is connected to a 

respective unit. The processing circuits can compute and convey 
identification numbering (i.e. TJnitlDs') and an active unit count by 
means of control frames sent and received at each of the three ports. 
However, these circuits and the associated multiplexing impose a 

25 considerable hardware processing overhead which increases (owing to the 

need for active clock and data recovery) with higher bit rates. 

Summary of the Invention 



30 



There are broadly three aspects to the achievement of a self-healing 
cascade which allows hot insertion or hot-swap. One is the organisation 
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of the data path so that it can accommodate insertion, removal or failure 
of units or links. A second aspect is the use of special headers so that 
packets on the data path can convey not only the ordinary address 
information characteristic of Ethernet packets as well as the payload 
5 (message data) and other parts of an Ethernet packet, but can also 

convey, within the system of the cascade, information which enables the 
switching engines of the units to operate as a single unit. A third aspect 
is the organisation and design of a control path by means of which 
information relating to the status or relative configuration of the units 
10 can be conveyed and automatically adjusted when units are added or 

subtracted operationally from the cascade. 

The present invention is particularly directed to the first two of these 
aspects. The third aspect, the control path, is a desirable feature of a 
15 system according to the present invention but has an independent utility 

and is the subject of our co-pending application entitled 'Cascade Control 
System for Network Units* filed on the same day as is the present 
application. 

20 As indicated previously, the main object of the present invention is to 

facilitate the provision of a self-healing hot-swappable cascade 
connection. Features of the invention include not only the physical and 
logical organisation of the cascade connection but also the organisation 
of a unit, so that it can obey the rules prescribed by the preferred 

25 organisation of the cascade connection, and a preferred signal format for 

use on the cascade. 

The present invention is based on the provision of a cascade connection 
in the form of a dual unidirectional connection so that, in its ordinary 
30 configuration, there is at least one and preferably more than one 

unidirectional ring for each direction -around the cascade, each ring 



including a respective port on each unit. For each ring, each port on a 
unit would be connected by a respective link to a corresponding port on 
the preceding unit and the following unit. 

The basis of this configuration is that in the event of a cascade 
operational failure (e.g. failure or removal of a unit or cable), the units on 
each side of the failure can maintain a cascade connection. A further 
object of the present invention is to provide mechanisms for such a 
healing process. 

The preferred form of the invention provides the aforementioned healing 
by causing the switching engines of the units immediately adjacent an 
operational failure in the stack to loop-back packets in one ring direction 
to egress the unit in the opposite ring direction. For the purpose the 
switching engines preferably coordinate information included in packet 
headers \A-ith status information which .may be conveyed by control 
frames. 

As will become apparent, each unit has at least two ports via which 
packets are forwarded onto the cascade and some means (such as 
address hashing) will be employed to provide an even distribution of 
forwarded packets among the ports. In order to avoid the possibility of 
endless loops by virtue of the combination of hashing and the action of 
the switching engines of units that perform loop-back/ a unit which is not 
performing loop-back performs in respect of some cascade ports a bypass 
operation m which packets received at a cascade port are sent out of the 
same port without being subject to the normal packet switching process. 

One aspect of the invention is the use of headers of packets sent on the 
cascade to assist the operation of the cascade. 
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A feature of the invention is the provision of a stack-wide port 
identification system, wherein a unique port ID comprises a portion (such 
as a six-bit binary number) identifying a port within a unit and a portion 
(such as a two-bit bmaxy number) identifying a unit. Accordingly when 
5 addresses are 'learned' by the units (i.e. made the subject of entries in 

the look-up database) they may be learned against a port identification 
which idejitifies both a unit and a port within that unit. The use of a 
stack- wide identification scheme enables the source unit (i.e. the unit 
which first puts a packet onto the cascade connection) to perform a 
10 complete look-up (if this is possible) of the forwarding data for a packet. 

Another feature of the invention is the use of a header segment, 
preferably in the form of a bit mask, which identifies those units which a 
given packet has traversed. This provides a means of discarding packets 
15 which have traversed every unit. 

As indicated above, the invention may be used in conjunction with an 
improved control path which comprises a virtual dual-ring path 
constituted by a single ring of half-duplex links across which control 

20 messages are repeatedly sent. Such a path can be 'healed' into a single 

virtual ring by sensing whether active valid control frames exist on a link 
and causing loop-back of control frames within a unit in the absence of 
such frames. The control data may include information such as the 
identity of a unit, a list of units physically present in the cascade and 

25 also a list (which may be different) of active units in the cascade. 

A further object of the present invention is to enable the cascade to be 
formed by means of links, such as cables, coupled between the cascade 
ports of the units m the cascade and being adapted preferably to convey 
30 ■ the control signals employed in the preferred embodiment of the 
invention. More particularly it is an object of the invention to achieve this 





-7- 



without requiring special multiplexer connectors, such a T-pieces, which 
would plug into special modules or ports on the units and which are 
organised to maintain a cascade path notwithstanding the failure or non- 



Further features of the invention will become apparent from the following 
description and the accompanying drawings. 



Brief Description of the Drawings 

10 

Figure 1 illustrates schematically a switch for use in the invention. 

Figures 2 and 3 illustrate the operation of a cascade connection in 
accordance with the invention. 



operational state of a unit to which a T-piece connector is coupled. 
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Figure 4 illustrates a packet including cascade status information. 



Figure 5 illustrates a cascade algorithm. 



20 



Figure 6 illustrates schematically a cascade card in a switch. 



Figure 7 is a schematic representation of cascade control logic. 



Figure 8 illustrates a link layer frame format. 



25 



Figure 9 illustrates frame traffic at a link layer. 



Figure 10 illustrates logic blocks for cascade control logic. 
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Figure 11 dlustrates control frame data. 
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Figure 12 is a state diagram. . 

Figure 13 illustrates an algorithm for unit identification numbers, 
5 Figure 14 illustrates an algorithm for the control logic. 

Figure 15 illustrates a register format. 
Figure 16 illustrates another register format. 

10 

Figure 17 illustrates another register format. 

Figure 18 illustrates a switch unit in accordance with the invention. 

15 . Figure 19 dlustrates vsa-ious register fields in a normal mode of operation. 

Figure 20 illustrates various register fields in a healed mode of operation. 

Figure 21 illustrates various register fields in another healed mode of 
20 operation. 

Detailed Description 

(a) General Description of a Switch 

25 

The reader is presumed to be generally familiar with the design 
construction and operation of network switches and routers intended for 
the handling and processing of address data packets, particularly 
according to Ethernet formats and protocols and procedures in 
30 accordance, for example, with IEEE Standard 802.3 dated December 

1998. However, for the sake of completeness, a brief and deliberately 
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simplified review of a network switch is given hereinafter for the sake of 
convenience. A more detailed description of the features of the switch 
relevant tc the present mvention will be given with reference to Figure 18. 

5 The swatch 1 shown in Figure 1 comprises a multiplicity of ordinary or 

Tront paner ports represented in the Figure by ports 2 to N. Very 
typically there would be twelve or perhaps twenty-four of these front 
pajiel ports, which are selectively connected to other devices such as 
hubs, switches, user terminals ajid suchlike typical of a communication 
1 0 network. 

The switch shown in Figure 1 also has four 'cascade' ports which will be 
connected in a cascade connection according to the invention. 

15 Associated with each port is a 'port ASIC (2a - 6a and Na) which 

provides, typically, for buffering of packets,.received at the respective port 
or to be forwarded (depending on whether it is forwarding or receiving a 
packet) from the port. The port ASIC typically performs some preliminary 
or final processing of a packet. The ports are usually capable of standard 

20 procedures such as 'auto-negotiation\ by means of which the port can 

establish VN^hether the device at the far end of the link to which the port is 
connected is active and whether it has a transmission capability 
compatible with the respective port. 

25 Although rhe port ASICs are shown as discrete m the Figure, in modern 

design practice the port ASICs together with most if not all the 
components of the switch may be realised in a single large ASIC, 
indicated by the chain lines 7. Alternatively it may be realised as 
described hereinafter. 
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Figure 1 illustrates a system of buses 8 by means of which packet data, 
address data from the packets, control data and suchlike are 
communicated around the various components of the switch. Again, the 
bus system is shown in deliberately simplified form. Very typically there 
are separate bus paths for the various t3^es of signals. One example of a 
modern switch layout is shown in our co-pending apphcation number 
01052891-3 filed 5 June 2001. 

Very typically, each switch has a controlling CPU 9 which governs the 
operation of the components of the switch. These include a packet 
memory 10 for the storage of packets between the times of reception and 
forwarding. Typically the switch includes a forwarding database 11 to 
which a look-up engine 12 has recourse and in accordance with which a 
switching engine 13 can derive from the look-up the address data and/ or 
other associated ^data re.quired for forwarding the packet to. its required 
destination. The switch includes a multiplicity of control and status 
registers 14. 

Normally the forwarding database stores addresses (which may be MAC 
addresses or network addresses) against the relevant forwarding data (i.e. 
port numbers). A look-up database is typically compiled by performing a 
look-up in respect of the source address of an incoming packet and 
making an entry in the database if the source address is not already held 
in the database. 

The addresses m the database may be organised m a variety of different 
forms according to the search engine or engines employed. Some types of 
database employ hashing of address data in order to reduce long address 
words intc shorter words for storing in a memory, though in such a case 
it is necessary to provide linked lists of entries which are hashed to the 
same address and to compare search results in turn to determine 




whether they correspond to the correct input data. Other forms of look- 
up include trie look-ups. 

Different forms of forwarding database and techniques for performing 
5 look-ups are well described in for example GB patents 2337674, 2337659 

and 2350534. 

When a look-up is performed on a destination address, the forwarding 
data is retrieved and used by the switching engine 13 to direct the 
10 relevant packet to the required port or ports. 

It should be well understood that if a packet is received at a 'front-panel' 
port of one of the units, it will be temporarily stored while a look-up 
based on the address data is performed. If the retrieved forwarding data 
15 indicates that the only local destination is a port on the same unit, the 

switching engine will direct the packet to that port. If a destination port is 
unknown or is known but on another unit, the packet will be forwarded 
onto the cascade. 

20 The foregoing is deliberately simplified and does not include such known 

operations as address resolution where addresses are unknown. It does 
not include well known features of processing which may affect whether a 
packet is forwarded from any given port. These aspects are, for example, 
^sarne port discard* rules, VLAN membership, spanning tree logic, 

25 trunking logic and suchlike. Such processing features are commonly 

included 121 the operation of a switching engine. Moreover, the foregoing 
description does not take into account the distribution of processing that 
may be adopted if, for example, the switch is composed of a multiplicity 
of modules connected by high-speed internal links. Reference may be 

30 made for example to GB patent apphcations 0102310.0 and 0011239.1 



for further description of distributed processing in switches of 
nature. 



If a packet is received which is destined for a device not connected to the 
unit by which the packet was received, the packet is sent, subject to 
various processing requirements, to the cascade. In the switch shown in 
the drawing there are four 'cascade' ports. In general, a packet may be 
sent to any one of those ports and whichever that might be can be 
determined by hashing (for example by means of exclusive-OR circuits) 
address data in the packet to a short word, such as a two-digit word 
which will identify uniquely the selected port. Hashing procedures of this 
general nature are known from, for example, GB application 2361830 
and may be included in the packet processing described above. As will be 
further apparent, it is necessary to include some means by which the 
packet. will be transmitted by subsequent units in a consistent direction 
around the cascade. 

For reasons which will soon be apparent each of the cascade ports has 
two 'connectors\ one for the Up' direction and one for the 'Down' 
direction, so that each port can be connected to the corresponding port 
in the next and previous units in the cascade. 

(b) Cascade Connection 

The cascade connection shown in Figure 2 is in essence a trunk 
connection made up of four unidirectional rings. Unit 0 shows the four 
cascade ports denoted A, B, C and D from left to right. The four rings in 
this specific example are each capable of a transmission rate of 2.5 
gigabit per second and are so connected that they connect respective 
ports on the various units. Thus ring RA connects the ports A on the four 
units and is composed of links each from the TX side of a port A to the 
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RX side of the corresponding port A of the next unit; ring RB connects 
the ports 13, ring RC connects the ports C and ring RD connects the ports 
D. The organisation is such that whereas ports A and B receive packets 
on the respective Unks from the ports A and B of the preceding unit on 
5 the cascade and forward by way of the respective link to the respective 

ports on ihe next unit on the cascade, the direction of receiving and 
forwarding for ports C and D is reversed. 

The Unks between the units are each point to point connections each of 
10 which forms an individual MAC domain. As will be seen, the individual 

links are almost standard Ethernet links with some additional signalling 
to enable rhe cascade to function. All packets will normally be subject to 
the switching engine of each unit. 

15 It should also be remarked that some means needs to be employed to 

ensure that where, for example, a given unit receives a packet on port A 
but needs to forward that packet further round the ring, that packet will 
be sent out on the same port A in a consistent direction. One method of 
achieving this is for all the units to employ the same hashing algorithm 

20 for cascade port selection on some packet segment (e.g. a network 

address) which will not vary as the packet progresses around the 
cascade. 

It may also be denoted at the present stage that the cascade is preferably 
25 source-routed. When a unit receives a packet on a front-panel port it will 

perform a full look-up for the destination unit and port even if the packet 
IS destined for another unit. The advantage of such a technique is that no 
look-up b^indwidth is required for cascade port ingress. Units need to 
learn packet source addresses (SA) against the ingress port and unit. If 
30 look-up tables are to remain current then all the units have to update 

their look-up tables whenever a unit is removed from the stack. This 
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may be achieved by way of the normal learning' process of the look-up 
databases in the various units. 

(c) Casc:ade Path Healing 

5 

Figure 3 illustrates a circumstance where Unit 2 has failed or is powered- 
down prer^aratory to removal. As described later, cascade control logic 
monitors the status of the cascade units and in particular whether there 
is communication on all the links in the cascade and reports the 
10 powering-down of Unit 2 to the other units on the cascade. As further 

described in more detail later, the switching engines, each under the 
control of the respective CPU, will take the following actions to heal the 
cascade. 

15 Unit 3 will loop back packets from ports C and D to ports A and B. More 

specifically/, this unit will process packets that are received on either port 
C or port D and retransmit those packets on ports A and B. Likewise, 
Unit 1 will loop back packets from ports A and B to its own ports C and 
D. It will process packets received on ports A and B and retransmit 

20 packets on ports C and D. 

However, Unit 0 (which is not adjacent the powered-down Unit 2) will 
operate differently. When it receives a packet on either port A or port B 
then any ])ackets destined for Unit 1 or Unit 3 will be retransmitted on 
25 those ports. Any packets received on ports C and D will be retransmitted 

on those ports without any processing. This action is called herein 
'bypassing' and can. be controlled by means of information which 
identifies where there is absence of communication between units. This 
information may be conveyed by the control frames described later. 



30 
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Unit 0 in this example must perform a bypass operation because 
otherwise packets destined for Unit 3 could be caught in an endless loop. 
For example, if Unit 1 transmitted out of port C (or D) towards Unit 0 a 
packet that would normally have gone to Unit 2 and would normally 
5 hash' to port A or B, this packet would be received on port C or port D of 

Unit 0. If Unit 0 were in a normal operational mode it would perform a 
hash to decide which port should transmit the packet. If the packet were 
transmitted out of port A or B then the packet would return to Unit 1. 
Then this Unit 1 would retransmit to the packet to Unit 0, thereby 
10 forming ari endless loop causing the packet to circulate continuously and 

never reach its proper destination. 

As noted previously, powering-down of a unit is not the only cascade 
communication failure which can be Tiealed'. If for example there is a 

15 link failure, which might even be an accidental removal of a cable, 

between two ports, a similar healing pjocess can be executed. For 
example, suppose there is a link failure between cascade port A of Unit 2 
and cascade port A of Unit 3. Then the packets which would normally 
leave cascade port A of Unit 2 in the Up direction can be looped-back to 

20 exit from port C or port D in the Down direction. In such a case both Unit 

1 and Unit 0 need to be in the bypass mode so that the packets reach 
Unit 3 by way of the cascade ports C (or D) of Unit 1, Unit 0 and Unit 3, 

(d) Cascade Status Information 

25 

The preferred cascade protocol requires that 32 bits of data be included 
with the Irame as it is sent internally in the stack from one unit to 
another. These 32 bits of data are sent in a header at the start of the 
frame. Frames sent on the cascade may be sent with a 64 bit inter- 
30 packet-gap to allow for this. The CRC of the frame may also cover the 

cascade header. 
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When a unit receives a packet, it will perform, if the packet is to egress 
from a port on the same unit, the usual packet processing functions, 
which need not be described here in detail. If the packet is to be sent on 
5 the cascade, it is provided with a header which includes certain 

information, called herein 'cascade information' that is used by a unit 
that receives the packet to determine (as will be described later), in 
conjunction with the setting of various registers, the appropriate 
processing for the packet received on the cascade. As will be explained 
10 later, the registers are controlled by means of control data sent around 

the cascade on a control path and are set in accordance with the 
numbering of the units, which of the units are active or not and other 
information to be outlined. 

15 Figure 4 illustrates in simplified form a packet 41 which is sent onto the 

cascade by a unit in accordance with the invention. The packet 41 has a 
start of frame (SOF) sequence 42, a ^cascade header' 43, a MAC address 
section 44, a network data section 45, apayload section 46, a CRC (cyclic 
redundancy code] section 47 and ah end of frame (EOF) section 48. 

20 

The section 43 will be described below. It comprises the 32 bits of 
cascade information mentioned earlier. The information is inserted 
during the 'processing' of a packet by the switching engine. 

25 The section between 43 and 44 may be occupied by some internal control 

sequence :o delimit the start of the section 44, which is the layer 2 or 
MAC add-ess section occupied by a destination address (DA) and a 
source address (SA), These are in conventional 48-bit form including the 
usual bits to indicate whether the packet is unicast, broadcast or 

30 multicast. 
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Section 4 5, entitled 'Network' is intended to signify network (IP) 
addresses. VLAN data and other parts which are of no great significance 
to the present invention. 

Section 46 is the message part or payload of the packet, consisting of a 
multiplicity of bytes of user data. 

Section 47 is the CRC or cychc redundancy check portion which is 
normally (apart from the end of frame sequence 48) computed on all or 
part of the frame and is used to determine whether a frame has been 
received without error. The CRC may be (in known manner) computed for 
the packet including the header 43. 

Cascade Header 

The special 'cascade header' 43 for the present invention includes seven 
fields as explained below. 

The first cascade header field (i), denoted 'SrcPortID[8:0], is the source 
port number in the stack that receives the frame. 

This identification number and the destination port identification 
(DestPID) conform to a stack-wide format, wherein a portion of the 
identification number, and particularly the (least significant) bits [6:0] 
represents port numbering within a unit and another portion, 
particularly the (more significant) bits [8:7], represents the particular 
unit. Thus the cascade system or stack can in this example 
accommodate up to four units each having up to 128 ports. The format 
may be modified to accommodate more units. If eight units were the 




intended limit of the stack the TJnitID' portion of the ID field could be 
augmented to three bits. 

One advantage of such a format is that in most instances the source unit 
5 (which is ihe first unit m the stack to receive a packet from the external 

network) can perform a complete look-up for the destination port. Source 
port IDs for previously unknown addresses may be learnt in all the units 
to facilitate the performance of destination look-up in the source unit. 

10 The second field in the cascade header is denoted SrcTrunkID[4:0]. This 

is the trunk number of the source port that receives the frame, if the 
source port be a member of a trunk. 

This field is for the (known) purpose of identifying a trunk connection to a 

15 multiplicity of ports which may be distributed among the units in the 

stack, the trunk consisting of a multiplicity of essentially parallel links 
from those ports to a remote entity which may be a single switch but 
might be another stack of switches. The purpose of a trunk is to increase 
the bandv/idth available for transmissions from one unit or entity to 

20 another. The connection of trunks to switches in a stack produces some 

complexity- in the manner in which packets are forwarded on the cascade 
and although trunking and the difficulties of coping with it are not part of 
the present invention it needs to be said that the logic required for 
dealing with stack wide trunks preferably responds to an identification of 

25 a particular trunk so that units which receive packets by way of the 

cascade can determine whether such packets come from a trunk 
connection ajid, if so, which trunk connection. Examples of the handling 
of a stack wide trunk and the various modifications which need to be 
made to such rules as the 'same port discard' rule (in accordance with 

30 IEEE 802. 1) are set out m published GB patent applications GB-2359692 

and GB-2361830. 
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The third iield in the cascade header is a one-bit field denoted 'DPK*. This 
is merely an acronym for 'destination port known'. If this bit in the 
header is set, the bit indicates that the destination port for the frame is 
also carried in the cascade status header (see below). This bit enables the 
forwarding of such a frame when it is received by the unit having that 
destination port, without another look-up. 

The fourth field in the cascade header is a single bit field termed herein 
'unknown DA' i.e. 'unknown destination address'. When this bit is set, it 
indicates that the destination MAC address (DA) in a packet received (by 
way of a front panel port) by a unit in the cascade is not found in the 
look-up database of that switch. Each unit in the stack will treat the 
destination address as an unknown entry. This applies to both unicast 
and multicast addresses. 

The fifth field of the cascade header is the destination port identification 
'DestPID[8:0]'. This field conforms to the format discussed in relation to 
the source identification and uniquely identifies a destination port and 
the relevant unit in the stack for the frame if the 'destination port known' 
field is set. If this latter field is cleared, the 'destination port ID' field is 
ignored. 

The sixth neld in the cascade header is the box bit mask field, BBM[3:0]. 
This field, which is obtained as described hereinafter from a 'units 
present' register in a respective source unit, indicates by the setting of 
the respective bit the units which the relevant packet has already visited. 
Thus, if the bits indicate, from left to right, the Units 3 to 0 respectively, 
and the packet is received at Unit 1, then the box bit mask for the packet 
as it is put on the cascade by Unit 1 is 1101, that is to say all the units 
less the source unit. Unless an ingress port is in a bypjass mode, as more 
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. particularly described later, a unit will, as part of • the processing 
performed on the packet, clear the respective bit in the box bit mask. 
Reference is made below to Figure 5 for a fuller description of how the 

box bit mask is handled. 

5 

The final field in the cascade header is a *drop precedence' field 
constituted by a single bit. This bit is derived from the rules engine in the 
(source) unit in which the packet is first received. It is used at egress 
ports to de termine the drop threshold for the egress or transmit queue. If 
10 the bit is set, the lower drop threshold is used to determine if the packet 

is to be added to the queue. Otherwise the 'queue full' threshold is used 
when addmg the packet. In essence the drop precedence field may be 
employed :o give certain packets a higher priority within the stack. 

15 Box Bit Mask 

Figure 5 illustrates the manner in which the box bit mask field (BBM) is 
emploj^ed. This bit mask field does not affect the manner in which known 
unicast packets are normally handled. These packets, for which the 
20 destination port will be known, will be removed from the cascade ring by 

the destination unit. However, in the case of a change to the stack, for 
example reconfiguration which may cause the destination unit to be 
removed, the bit mask is required. It is also required to prevent 
recirculation of a packet. 

25 

As is shown in Figure 5, the packet is received at a unit from the 
cascade. This is stage 501. Stage 502 is a test to determine whether the 
unit IS in a cascade 'bypass' mode. If it is, then no processing is 

performed on the packet, as indicated by the 'do nothing' stage 503. The 
30 packet will be forwarded from the same port as received the packet, as 

described with reference to Figure 3 and later with reference to Figure 20. 




Stage 504 is a test to determine whether the relevant box bit mask bit for 
the unit is set. If the bit is not set, then as shown by stage 505 the packet 
must be removed because it has traversed the ring already. 

5 

If the respective box bit mask bit is set, then a test (stage 506) for 
whether the destination port is known for the packet determines, if it be 
not known, a look-up (stage 507). If the destination look-up obtains a 
destination port on this unit, the packet will be forwarded from the 

10 respective local port. Otherwise the packet is destined for the cascade. If 

the destination port is known, there is a test, stage 511, to see whether 
the destination port is on this unit. This requires only a simple bit match 
rather than a full look-up\ If the destination port is on the respective 
unit, the packet is sent to the local port (stage 512) and removed from the 

15 ring. If the destination port is not on the respective unit, the box bit 

mask bit ;s cleared (stage 508). If the bit. mask bits are all zero (stage 
509), then the packet must be removed (stage 510) because there are no 
more destination units. If there is one or more set bits remaining in the 
box bit mask, the packet may be sent to the ring (stage 513). 

20 

Cascade Control Logic 

The cascade which has been described in the foregoing requires in 
practice, in addition to the status information carried in the header of the 

25 packets, some control information which is passed between the units and 

which will enable them to be configured or reconfigured in a manner 
which will enable them to redirect packets and, in the case of a TDypass' 
mode, to perform no redirection, in accordance with the status of the 
various units withm the stack and particularly in respect of the *self- 

30 healing' 0])eration that has been described. Different forms of control 

may be employed but it is convenient and generally preferable to employ 
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the contj-ol mechanisms described hereinafter. These control 
mechanisras are the subject of our co-pending application of even date 
herewith l)ut are described here in full in order to provide a complete 
description of a cascade system. 

5 

The preferred form of cascade control is, as described in the following, an 
active system wherein in normal operation the units transmit various 
status parameters, which enable the units to compute their identification 
number (called herein 'unit ID'), to determine a list of units which are 
10 present in the cascade, and a list of which units are powered units within 

the cascade, and preferably also to inform the respective CPU of a variety 
of faults in the cascade. The preferred form of the cascade control is 
capable of performing self-healing in the event of removal of a unit or 
cable. 

As described earlier, known cascade units which can tolerate hot 
insertion and removal, and provide a degree of self healing, employ T- 
pieces which can maintain a data path and a control path around the 
cascade notwithstanding the powering-down or removal of one of the 

20 units. It :s advantageous in the context of the present invention to 

provide a control path which does not require connectors with internal 
multiplexers and which can provide for 'self-healing' in the event of 
removal of one of the units or cables in the cascade. The configuration 
employed for the control path is a chain of bi-directional, half duplex' 

25 links, each link extending from one unit to the next m a daisy chain of 

the units .around the control path. On each link, in normal operation, 
the respective units will exchange information in a time shared manner. 
This information is preferably exchanged continually and is derived from 
registers within the cascade control logic in each unit. 
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Figure 6 illustrates schematically the disposition of the cascade control 
logic within one of the switch units previously described. 

The switch unit 61 shown m Figure 6 operates m a manner similar to 
5 that functionally described with reference to Figure 1. In this 

embodiment, it is physically organised somewhat differently to the 
apparent ]:>hysical arrangement in Figure 1, in that it is composed of a 
group 62 of four switch modules 63, each of which is a multiple port 
module. These switch modules communicate between themselves by 

10 means of high speed links in the manner described in, for example, GB 

patent applications 0102310.0 and 0011239.1. However, the internal 
organisation of the switch module 62 is not particularly relevant to the 
present invention and it is sufficient to indicate that the modules operate 
as a single switch in respect of the combined set of ports that the 

15 modules have and are organised to forward onto the cascade or receive 

from the cascade packets on line 64 which are coupled to the four 
cascade ports previously described. 

The cascade control logic is designed to monitor the cascade and to 
20 provide the following functions: (i) to provide the respective unit with a 

Unit ID; (ii) to provide the respective unit with a 'List of Units Present'; 
(iii) to pro\-ide the respective unit ^vith a *List of Powered Units'; and (iv) to 
inform the CPU of cascade faults 

25 The Cascade Control Logic (CCL) 66 may be implemented as an FPGA 

(field programmable gate array) that is accommodated on a cascade 
downlink card 65. The interface between the CCL 66 and CPU 9 is a 
Serial Maiiagement Interface (SMI) 68 as defined by IEEE 802.3-1998 
Clause 22. The CPU is able to interrogate registers internal to the CCL. 

30 The registers contain information such as the Unit ID, the List of Active 
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Units and the cascade link status. The CCL is able to inform the CPU of 
sun urgent event via an interrupt signal 69. 

The links 67 denoted Up_Control and Down_Control are bi-directional 
5 half-duplex serial Unks. The Up_Control signal path of a unit is 

connected to the Down_Control signal path of the next unit in the stack, 
m a daisy-chain fashion. Signalling on the bi-directional links is 
controlled by a master-slave relationship, the CCL generating the 
Up_Control signal being the master. Transmission of frames alternates 
10 between the master and slave, as described with reference to Figure 9. 

As is particularly shown in Figure 7, the cascade control logic 66 receives 
a signal, denoted Unit_Power_Sense' in accordance with the state of 
energisation of the particular unit. It exchanges information by way of 
15 the SMI Bus 68 with the central processing unit (CPU) and can provide 

interrupt signals on an interrupt line 69. 

The cascade control logic 66 transmits control frames to, and receives 
control frames from, the cascade control logic 66a for the next unit by 

20 way of a line denoted Up.Control', which is a single line coupled to 

ground by a pull-down resistor 71. The cascade control logic 66 receives 
control frames from, and transmits control frames to, the cascade control 
logic 66c in the unit next below it by way of a Down_Control line having 
pull-down resistor 72. Figure 7 shows the completion of the daisy chain 

25 by CCL 66b and 66c, each of the cascade control logic units (66-66c) 

being connected by way of its Up_Control line to the Down_Control line of 
the control unit next 'above' it in the chain and also being connected by 
way of its ^Down__Contror line to the Up_Contror line of the cascade 
control logic m the unit next 'below it' in the chain. 
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The daisy-chain of bi-directional half-duplex links forms a complete ring. 
In normal operation this creates two virtual rings: one rotating clockwise 
and bemg the transmission direction for 'master' control fraines, the 
other anti -clockwise, being the transmision direction for 'slave' control 
5 frames. If there is a break in the cascade wiring the control logic, and 

particularly its transport layer as described with reference to Figure 10, 
can 1oop-])ack' frames. This allows the cascade control path to lieal' in 
the same manner as the cascade data path. 

10 The cascade control logic derives its power from a shared cascade VCC 

power line 73. This means that the CCL of a powered-off unit is still able 
to participate in Unit-ID numbering. The input Unit_Power_Sense 
indicates to the CCL whether the unit is powered up. 

15 Although there is a variety of ways in which the units can be connected, 

it is convenient to employ between succj^ssive units a common cable 
which has a plurality of paths (in this example four) for data packets, a 
single con-rol path and a single VCC line. The data paths may be twisted 
pairs or optical fibres. Each cascade port may have two multi-pin 

20 terminals, one each for the Up and Down directions. 

Cascade Control Signals 

The control frames shown in Figures 8 and 9 are transmitted and 
25 received by the link layers which will be described with reference to 

Figure 10. 

Figure 8 illustrates at 81 the waveform of the cascade control signals and 
at 82 the significance of each of the parts of the waveform. 

30 
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The preamble of each control frame consists of the sequence 10101010. 
The chips (channel bits) of the preamble are Ti>iT/2 (lus) m length, half 
the length of the normal data bits. Hence the preamble constitutes a 
unique setiuence not found in the control data. The preamble is followed 
5 by 4 data bytes (each with an odd parity bit). The data bits are Tbit (2us) 

in length. In Figure 8, 'BO, DO' represents the first digit (bit) m the first 
byte and so on. The line is driven low for 2us at the end of transmission, 
denoted by the 'Idle Low^ (00) segment after which it is tri-stated (denoted 
by "Z^). The duration of each frame is therefore 82us. 

10 

Control data is transferred between units in a half-duplex fashion. The 
CCL is a 'master' on Up_Control and a 'slave' on Down_Control. The 
'master' transmits a control frame (shown as the ^master frame^ 91 in 
Figure 9 every Tfmme (200us). This is followed by an interval 92 of four 
15 times the bit period. Then the 'slave' transmits a control frame 93 after 

receiving the master's control frame. The end of the frame is defined as 
being the end of the 2us 'idle low' interval. An idle interval 94 lasts until 
the end of the control cycle period (Tfi-ame]. 

20 Control Link Layer 

Figure 10 illustrates in more detail the cascade control logic 66. It 
broadly comprises two link layers, link layer 10 lU, connected to the 
unit's 'Up._Control' link and link layer lOlD, connected to the unit's 

25 *Down_Contror link. The link layers transmit and receive 4-byte control 

frames between the adjacent units. They also indicate, by responding to 
the presence or absence of valid control frames, the status of the 
Up_Control and Down control links by means of the signals CTRL_OK_UP 
and CTRL_OK_DOWN respectively. Link layer 10 ID indicates the idle 

30 state of th-? Down^Control link on a line denoted 'RESILIENT'. 
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Between tlie link layers is a transport layer 102 and a Field Update Block 
103. The transport layer includes a first multiplexer 104 which is 
governed l)y the 'Control_OK-Down' signal. A second multiplexer 105 is 
controlled by the control OK-Up signal. Multiplexer 104 will pass either 
5 the Down Rx-Data obtained from the link layer 10 ID or the 

Down_Tx_Data which is supplied to the link layer 10 ID from a register 
set 106 denoted 'Up_Regs'. Multiplexer 105 will pass to the registers 106 
either the Up_Rx_Data received from the link layer 10 lU or the 
Up_Tx_Dava which is supplied to the link layer 10 lU from the Field 

10 Update Biock. The Field Update Block receives the output of the 

multiplexer 104. It includes a register set 107 denoted T)own_Regs', a 
processing function 108, and a register set 109 denoted *My-Regs'. It 
provides an output by way of the transport layer 102 to an input of the 
multiplexer 105 and to the Up' link layer lOlU. Thus Up^Tx„Data is 

15 supplied to the Up_Control and, controUably, to the register set 106. 

The Transport Layer 102 provides the 4-byte data content of transmitted 
control frfimes. The Link Layer returns the data content of received 
frames to the transport layer. The Link Layer discards received frames 
20 that contain parity errors. 

The link layer 10 ID reads the idle state of the Down.Control line Tbit/2 
(lus) after the 'idle low' following transmission of the slave frame. The. 
result is output on RESILIENT (see Figure 10). If a normal cable is 
25 attached io Down_Control the weak pull-down on the line will ensure 

that the F.ESILENT is low (i.e. denotes FALSE). A resilient cable differs 
from a normal cable m that it has a strong (IK) pull-up on its control 
signal lines- This pull-up will ensure that RESILIENT returns TRUE if a 
resilient cable is attached to Down_Control. 
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At time T. prune after reset and every Tupdate thereafter CTRL_OK_UP and 
CTRL_OK_DOWN are evaluated. CTRL^OK.UP is set to TRUE if at least 
one frame has been received on Up_Control in the preceding Tupdate 
(otherwise it is set to FALSE). Similarly, CTRL_OK_DOWN is set to TRUE 
5 if at least one frame has been received on Down_Control m the preceding 

Tuixlatf. 

Cascade Control Transport Layer 

10 The purpose of the transport layers is to circulate a 4 b^^te data field 

through tlie cascade control logic of all units of the stack. The data is 
first generated by the Field Update Block (FUB) of the 'bottom' stack unit 
and circulates through the FUB of each stack unit, eventually returning 
to the boTtom unit. It is essential that the order of data circulation 

15 through the FUBs is preserved, regardless of whether the cascade control 

signal path is healed or not. 

Under normal operation (CTRL_OK_UP == TRUE and CTRL.OK^DOWN 
== TRUE) data from control frames received on Up_Control is written into 

20 the register set Up_Regs' 106. The Up.Regs register set is used as the 

source of data for frames transmitted on the Down_Control line. Data 
from frames received on the Down_Control line is written into the 
Down^Regs register set 107. This data is parsed and modified by the 
processing function 108 FUB before being written to the My_Regs' 109 

25 register set. This set 109 is used as the source of data for frames 

transmitted on the Up_Control line. 

If either of the signals CTRL_OK_UP and CTRL_OK_DOWN is FALSE then 
the control data paths are looped-back within the transport layer so as 
30 not to send the control data on the relevant control link. This effectively 

heals the control frame data path so that data always passes through the 
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FUB of each unit in the same order. There is not necessarily a one-to-one 
relationsh.p between frames received on Down_Control and those 
transmitted on Up„Control (and vice-versa) since each port may have 
different master clocks which may differ slightly in frequency. 

5 

Consider again the stack labelled ^Normal Operation' in Figure 2. Data 
generated by the FUB m Unit-0 will circulate (in a clockwise direction) 
through -the FUB m Unit-1, Unit-2, Unit-3 and will then received by Unit- 
0 on Down_ControL Data will also circulate anti-clockwise through the 

10 Up_Regs in each unit. Since this data is not processed by the FUB it is 

meaningless. If the cable between Unit-1 and Unit-2 is disconnected, 
data generated by the FUB in Unit-0 will be written into the FUB of Unit- 
1. Since CTRL^OK_UP in Unit-1 is FALSE, the FUB data output 
(Up_Tx_Data) is looped-back (via Up_Regs) and sent out on 

15 Down_Control. This data is received on Up_ControI on Unit-0 and is 

written into Up_Regs. The data is then re-sent out on Down_Control and 
is received by Up_Control on Unit-3. Unit-3 also transmits the data out of 
Down_Control without processing it. Unit-2 receives the data on 
Up_Contrcl. Since CTRL_OK_DOWN on Unit-2 is FALSE, this data is 

20 looped-back into the FUB (via Up_Regs). The FUB processes the data and 

sends it out of Up^ControL The data is processed by the FUB in Unit-3 
and is then sent out of Up_Control to Unit-0. The path of the data 
through tiie FUBs is Unit-0, Unit-1, Unit-2, Unit-3 and then back to 
Unit-0 as in normal operation, 

25 

Control Frame Data 

The control frame data transmitted and received by the FUB 103 has the 
format shown m Figure 11. An 'idle' frame has IDLE set to 1'. This 
30 indicates that only the first byte of the frame data is valid. 'Active' frames 

have IDLE set to '0' and all the frame data is valid. HEAL^REQ and HEAL 
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are used to indicate that the cascade data path must be healed by the 
CPU. RENUM_REQ and RENUM force the FUB to renumber all units in 
the stack MISCONFIG_REQ and MISCONFIG are used to signal a 
cascade cabling mis~configuration. CURRENT_CTRL is a list (in the form 
5 of a bit mask) of all units present m the stack; NEXT_CTRL is a 

temporary variable used m its calculation. CURRENT_POWER is a list 
(bit mask) of all units that are powered-up; NEXT_POWER is a temporary 
variable used in its calculation. 

10 Primary & Secondary Status 

The FUB 103 must decide whether the unit is a 'primary' or a 'secondary' 
unit. There should only be one primary unit in a stack aad it processes 
control data in a different manner than secondary units. The primary 
15 unit initiates the transmission of 'active' control frames around the stack. 

If a primaiy unit does not initiate the transmission of active frames then 
secondary units will transmit 'idle' frames. 

Figure 12 shows how the FUB evaluates whether the respective unit is a 
20 primary or secondary unit. The signal ACTIVE_FRAMES is a signal 

internal to the FUB that is evaluated every Tuixiare. ACTIVE_FRAMES is set 
to TRUE if active frame data has been received by FUB at least once in 
the preceding Tupdare (otherwise it is set to FALSE). 

25 If there is a resilient cable plugged into the unit's Down^Control then 

RESILENT==TRUE and the unit becomes a primary (at time Ti after 
reset). 

If there is ao resilient cable between top and bottom units, a unit will not 
30 receive any active frames and it will become (at time T2 or later) the 




primary if Down_Control is not connected to another unit 
(CTRL_OK_DOWN==FALSE). 

If the resiUent cable is replaced with a normal cable (i.e. all cascade 
5 cables are normal cables), there will be no unit with 

CTRL_OK_DOWN==FALSE and all units will remain secondary units. If 
the FUB of a secondary unit has received no active frames by time T3 
then It will set the MISCONFIG_REQ bit in its SMI register. The change in 
state of the SMI register MISCONFIG_REQ will also cause an interrupt to 
10 alert the CPU to the error condition. 

If there is more than one resilient cable in the stack, most, but not all 
error conditions could be detected if the presence of a resilient cable on 
both Up_Control and Down_Control were checked. This mechanism fails 

15 in the case of a four-high stack with resilient cables between the top and 

bottom units and between the middle two units. The CPU can detect this 
combination and all others so it is preferable to leave checking for two 
resilient cables as a CPU task. For example, a stack management agent 
(8 MA) in a primary unit could broadcast a special packet containing the 

20 unit's mac: address. If a primary unit's SMA received such a packet with 

an IP addiess not its own, then it would know that there are more than 
two primar y units in the stack. 

Unit-ID Numbers 

25 

The purposes of unit numbering are listed below in descending order of 
priority. Tlie order of priority means that it is more important that units 
have ascending numbers than that that they retain their existing Unit-ID. 
So, if a unit is added to the bottom of a stack it is likely that units above 
30 it will need renumbering. 
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Each stack unit has a unique Unit-ID. It is desirable to provide newly 
powered-up units with a sequential Unit-ID (so that Unit- IDs increase in 
3X1 ascending order from Unit-0 at the bottom of the stack). Jumps in the 
ascending order of unit numbers are acceptable, 

5 

Existing powered-up units retain their Unit-ID (provided this does not 
lead to non-sequential unit numbers). This minimises modification of 
lookup tables. 

10 The algorithm that achieves these goals is expressed in Verilog code in 

Figure 13. In Figure 13 'my_regs_UNIT_ID* refers to the parameter 
UNIT_ID in the 'My_Regs' registers. Similarly 'down_regsJDLE' refers to 
the param.eter IDLE in the ^Down_Regs' registers. The algorithm also 
deals with the error condition of the FUB receiving a frame with Unit-ID = 

15 3. This could occur in a four-unit stack where the bottom three units had 

Unit-IDs 0, 1 and 3. The fourth unit will" transmit -frames- with- 
RENUM^REQ set. The primary unit will receive this and transmit frames 
with RENUM set, causing all units in the stack to renumber to sequential 
Unit-IDs. The algorithm also checks for the condition of more than four 

20 units in tlie stack - in which case the fifth unit transmits frames with 

MISCONFIG_REQ set. The primary unit will copy this bit into the 
MISCONFjG bit of its frames, ensuring that all units are aware of the 
stack mis-configuration. 

25 

List of Powered Units and List of Units Present 

A list (bit mask) of powered units (CURRENT_POWER[3:0]) is needed for 
the following reasons: 

30 

(a) To allow the CPU to purge the lookup table of entries relating to a 
non-powered unit. 



(b) To cUlow the CPU to configure the switch so that only packets 
destined fcr a powered unit are forwarded on the cascade. 

The list of units present (CURRENT_CTRL[3:0]) is the Ust of units 
participating in the control signal path. Since the cascade powers the 
CCL, the list of units present will also include units that are powered-off 
This list is necessar}^ to detect the condition of more than four units in a 
stack. 

CURRENT„POWER[3:0] and CURRENT_CTRL[3:0] are compiled using the 
algorithm in Figure 14. The algorithm makes use of the temporary 
variables NEXT_POWER[3:0] and NEXT_CTRL[3:0]. The primary unit 
initialises NEXT_POWER and NEXT.CTRL setting only the bit 
corresponding to its Unit-ID. Secondary units receiving this data set their 
bits within these fields. When this data^. returns to the primary unit 
NEXT_POWER and NEXT_CTRL form a complete list of the powered units 
and units present. The primary unit copies these fields across into 
CURRENT_POWER and CURRENT_CTRL. 

The CPU is able to access the latest version of CURRENT^POWER via an 
SMI-bus accessible register. This information is used as a forwarding- 
mask for the switches such that only packets destined for active units 

are forwarded in the cascade data path. 

Request Data Path Healing 

All units m a stack will take action to heal the data path. If the reason for 
data path healing is a powered-down unit then all units will be aware of 
this since CURRENT_POWER and CURRENT_CTRL will differ. In the case 
of a missing cascade cable, the problem will be visible to adjacent units 
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since either CTRL_OK_UP or CTRL^OK_DOWN will be set to FALSE. The 
FUB in these units should set the bit HEAL„REQ in active frames that 
they transmit. The FUB in the primary unit will copy this bit to HEAL, 
ensuring ihat all units are aware of the need to heal the cascade data 
5 path. 

Interrupts 

The Interrupt line to the CPU is asserted whenever one of the following 
10 SMI register values change: CURRENT^POWER, UNITJD, RENUM, 

MISCONFlG_REQ, MISCONFIG or HEAL. The mterrupt stays asserted 
until reset by writing to a SMI register. Interrupts are initially disabled at 
power-up and are enabled by writing to a SMI register. 

15 Time Sequence after Power -Up or Reset 

In a first phase, 0 to Ti. the Link Layer will transmit and receive frames 
between adjacent units. Since there is no primary, these will be idle 
frames. The Link Layer will evaluate CTRL.OK^UP and CTRL_OK_DOWN 
20 every TuiKU-re. The Transport Layer will loop-back the control data path if 

necessary. Ti is approximately 10ms - enough time for the Link Layers to 
send out frames and the Transport Layer to have healed the control data 
path (by performing any necessary loop-backs), 

25 In a second phase, Ti to Ti, the FUB in each unit will determine primary 

or secondary status. If there is a resilient cable within the stack then the 
primary will begin to transmit active frames and unit numbering will 
proceed. Complete control frame data will be computed within 2ms. This 
is the time taken for control data to circulate twice around the stack. [It 

30 takes 82us for Umt-0 to traasmit a frame to Unit-1. It takes up to 282us 

(Tfmme + duration of frame) to transmit a frame with this data from Unit-1 
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to Unit-2 (since Unit-2 may have just started transmitting a frame with 
old data). It also takes up to 282us to transmit a frame with this data 
from Unit-2 to Unit-3 and from Unit-3 to Unit-0. The total time to 
circulate control data around a stack is thus 3*Tframe + 4* (duration of 
5 frame) = 928us. Time T2 is approximately 100ms - enough time for the 

active frames to have reached all units, if there is a primary present. 

If a third phase, T2 to T3 is reached, no active frames are being received, 
the FUB ia each unit will re-evaJuate primary and secondary status. If 

10 there is a unit will CTRL_OK_DOWN==FALSE then it will become the 

primary and it will begin to transmit active frames. Unit numbering will 
proceed and complete control frame data will be computed within 2ms. T3 
is approximately 200ms - enough time for the active frames to have 
reached all units, if there is a primary present. If no primary is present, 

15 there must be a stack-cabling mis-configuration. 

CCL Operation ExamplesConsider the stack of four units labelled as 
'Normal Operation' in Figure 2. Assume that the units are first powered 
up simultaneously. The Link Layers in each unit will exchange idle 
20 frames every Ttvame- At time Tupciate CTRL_OK_UP, CTRL_OK_DOWN will be 

evaluated as TRUE. As a result the Transport Layers in each unit's CCL 
will set up the control data paths without loop-back. 

At time Ti the FUB in each unit will evaluate its primary/ secondary 
25 status. The bottom unit will become a primary and Unit-0 (since it has a 

resihent cable connected to Down_Control). The FUBs in other units will 
identify themselves as secondary units. The primary unit will start to 
transmit active frames on its Up^Control line containing Unit-ID=0, the 
signal CURRENT_POWER=4'bOOOO, the signal NEXT_POWER-4T30001 , 
30 CURRENT_CTRL=4b0000 and NEXT_CTRL=4Td0001. The next unit up 

the stack will receive these values as part of frames on Down_Control. It 



set its Unit-ID = 1, NEXT_POWER=4'b001 1 and NEXT^CTRL=4'b001 1 (in 
My_Regs) and will transmit frames with this data on Up_Control. The 
data will progress up the stack with the next units taking Unit-ID 's 2 and 
3. When tlie bottom unit (Unit-0) receives frames containing this data on 
Down_Cor.trol, it will copy NEXT_POWER (4T31111) to CURRENT_POWER 
and NEXT_CTRL (4hllll) to CURRENT^CTRL indicating 4 powered-up 
units in the stack. 

To provide an example of an operational failure, suppose that a network 
engineer trips over a power cable and powers-down Unit-2. This is the 
configuration shown as 'Healed Ring' in Figure 3. Since the CCL logic is 
powered by the cascade the control signal path is unaffected and control 
frames continue to circulate. However the Unit_Power_Sense line on the 
input to the CCL on Unit-2 will go low. The FUB in Unit-2 will set its bit 
in NEXT_POWER to '0'. The frame field CURRENT^POWER will change in 
all units to 413101 1, indicating the lack of power to Umr-2. The CCL in all 
units will assert Interrupt to indicate the change in the cascade status. 
Units 1 and 3 will heal the cascade data path by looping-back the 
cascade data path. All units will update their forwarding tables to remove 
entries for Unit-2. The use of CURRENT_POWER to mask transmission of 
frames onto the cascade will quickly remove cascade packets destined for 
Unit-2. 

The network engineer will eventually notice the problem and will plug the 
power cable back into Unit-2, The CCL all units will identify that 
Up_Control and Down_Control are now active and that power has been 
returned to Unit-2. CURRENT^POWER will be changed to 4Td1111 and 
CURRENT _CTRL to 4^)1111. The CCLs on all units will interrupt their 
CPU to alert them to re-read the CCL registers and update their 
forwarding tables. The state of the stack will quickly be healed into the 
topology labelled 'Normal Operation' in Figure 2. 
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Timing Intervals 

Table 1 is a summary of the timing periods employed in the preferred 
5 embodimeat. 



Table 1 



Symbol 


Description 


Value 


Units 


Tint 


Control frame bit period. 


2 


us 


Tfinme 


Time between start of transmission of 

each master control frame. 


200 


us 


TnixIriT<» 


Time between updates of CTRL_OK_UP, 
CTRL OK DOWN, and ACTIVE_FRAMES. 


1 


ms 


Ti 


Time after reset at which FUB evaluates 
primary status based on presence of 
resilience cable. 


10 


ms 


T_> 


Time after reset at which FUB evaluates 
primary status based on CTRL^OK_DOWN 
and absence of active frames. 


100 


ms 


Ta 


Time after reset at which FUB evaluates a 
stack-cabling mis-configuration based on* 
the absence of active frames. 


200 


ms 



10 

Control Logic Registers 

Figure 15 to 17 illustrate the allocation of space in each of the registers 
in the cascade control logic. 

15 

Figure 15 shows the register format for the Up_Regs' and T)own„Regs'. 
Each of diese registers is a four byte register. Bit 0 of byte 0 if set 
indicates an idle state. The second byte, byte 1, has byles indicating 
heahng, a heal request, renumbering, a renumbering request, a 
20 misconfigviration and a misconfiguration request. Bits 7 and 8 are 

reserved. The first 4 bits of byte 2 comprises a four bit word indicating 




the next control. Bytes 4 to 7 of the second byte is a four bit word 
indicating a current control. The last byte, byte 3, is composed of two 4 
byte words denoting 'next power' and 'current power'. 

5 Figure 16 illustrates the register format for 'My_Regs' in the cascade 

control logic. This is similar to the format described in Figure 15 but the 
register has an additional hyte of which the bytes 0 and 1 denote the unit 
identification, bit 2 indicates CTRL_OK_DOWN', bit 3 denotes 
^CTRL_OK_UP\ bit 4 denotes ^primary', bit 5 denotes 'resilience', bit 6 
10 denotes 'My_Power' and bit 7 denotes *Active_Frames*. 

Figure 17 illustrates the registers in the SMI memory map. The first five 
bytes correspond to the contents of the register described in Figure 16. 
The ninth byte indicates the module identification number. The bits m 
15 the other bytes are reserved. 

Cascade Operation 

The following description provides examples of the usage of the cascade 
20 status information in the operation of the cascade shown in Figures 2 

and 3. It also shows how the cascade control logic detects and 
distinguishes between the various kinds of operational failure and how 
theat logic and in particular the registers constitute with the CPU a 
means of responding to the status information represented by the control 
25 frames (or their absence) to control the switching engine. Figure 18 shows 

a simplified view (based on Figure 1) of the switch architecture. The four 
switch ASICs 63 of Figure 6 have been represented as one block (as in 
Figure 1) and their internal architecture simplified. 

30 Figure 18, shows within the ASIC 7 a bus system 8, a forwarding 

database 11, a look-up engine 12, a switching engine 13 and registers 14 
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as previously described with reference to Figure 1. The memory space is 
shown in rwo parts for convenience. Part 10a, denoted 'Rx Queues' stores 
received packets whereas part 10b, denoted Tx Queues' stores packets 
which are ready for transmission from respective ports. Figure 18 also 
5 shows media access controllers (MACs) which receive packets from and 

send packets to the physical ports. The MACs 182... 18N on the left are 
each associated with a respective one of the 'front panel' ports 2...N 
whereas tlie MACs 183, 184, 185 and 186 are each associated with a 
respective one of the ^cascade' ports 3 to 6. Each MAC 183-186 receives 
10 data signals (packets) from and sends such signals to the respective port 

and thereby forms a respective MAC domain with the MAC at the other 
end of the link to which the respective port is connected. 

The CPU 9 is shown for convenience separate from the ASIC 7. It is 
15 coupled to the cascade control logic 66 in the manner described with 

reference to Figure 6. 

Figures 19 and 20 illustrate the CCL register fields in normal operation 
(as in Figure 2) and in self-healing operations (Figure 3). 

20 

The Cascade in Normal Operation 

When a packet is received on one of the front ports 2 to N shown on the 
left-hand side of Figure 18, the packet passes through the MAC and is 

25 temporarily stored in the 'Rx Queues' portion of the memory. The 

switching engine 13 will read the layer-2 source and destination address 
from the header of the packet. The look-up engine 12 will (with recourse 
to database 11) determine the destination port of the packet. If the packet 
is destmefi for another front-panel port it will be forwarded to the Tx 

30 Queue associated with that port. If the look-up engine 12 determines that 

the packet is destined for the cascade then it will be forwarded to the Tx 
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Queue of one of the cascade ports. The cascade is a trunk of four ports so 
a hash is iirst computed (based typically on the Layer-2 or Layer-3 source 
and destination addresses) which determines which of the four ports will 
transmit the packet. Packets in the Tx Queues are transmitted on the 
5 ports on a first-in, first-out basis. 

The CPU 9 IS able to determme the status of the stack by reading 
registers v ithin the Cascade Control Logic (CCL). In general it will need to 
do this once soon after power-up. The CPU does not need to periodically 
10 examine the CCL registers since the CCL will interrupt the CPU m the 

event of a change in stack status. 

As may be seen in Figure 19, in normal operation the 
'CURRENT_POWER\ 'CURRENT_CTRL' fields are all T and the 
15 CTRL_OK_UP and CTRL_OK_DOWN fields are each '1', whereas the 

'HEAL' field is '0\ 

Cascade Operation in the Transition from Normal to Healed Mode 

20 Now consider that the power for Unit-2 is removed. The CCL signalling 

mechanism described with reference to Figures 6 to 16 will quickly 
(within approximately 2ms) update the CCL register fields within each 
unit. The change in value of CURRENT_POWER will cause the CCL in 
each unit to interrupt the CPU. The CCL register fields will have the 

25 values shown in Figure 20. 

The CPU in Unit-2 will take no action since it will be powered-off. The 
CPUs in units 0, 1 and 3 will respond to the interrupt by reading the CCL 
registers. Since CURRENT^CTRL = 4^31111 and CURRENT.POWER = 
30 4T31011, the CPUs will conclude that Unit-2 is powered-down and the 




Queue of one of the cascade ports. The cascade is a trunk of four ports so 
a hash is first computed (based typically on the Layer-2 or Layer-3 source 
and destination addresses) which determines which of the four ports will 
transmit the packet. Packets in the Tx Queues are transmitted on the 
5 ports on a first-in, first-out basis. 

The CPU 9 IS able to determine the status of the stack by reading 
registers within the Cascade Control Logic (CCL). In general it will need to 
do this once soon after power-up. The CPU does not need to periodically 
10 examine the CCL registers since the CCL will interrupt the CPU in the 

event of a change in stack status. 

As may be seen in Figure 19, in normal operation the 
'CURRENILPOWER', 'CURRENT_CTRL' fields are all 1' and the 
15 CTRL_OK_UP and CTRL_OK_DOWN fields are each 'l\ whereas the 

HEAL' field is V\ 

Cascade Operation in the Transition from Normal to Healed Mode 

20 Now consider that the power for Unit-2 is removed. The CCL signalling 

mechanism described with reference to Figures 6 to 16 will quickly 
(within approximately 2ms) update the CCL register fields within each 
unit. The change in value of CURRENT_POWER .will cause the CCL in 
each unit to interrupt the CPU. The CCL register fields will have the 

25 values shewn in Figure 20. 

The CPU in Unit-2 will take no action since it will be powered-off. The 
CPUs in units 0, 1 and 3 will respond to the interrupt by reading the CCL 
registers. Since CURRENT^CTRL = 4Td1111 and CURRENT.POWER = 
30 41^1011, the CPUs will conclude that Unit-2 is powered-down and the 
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cascade must be healed. In this example, since HEAL = 0 there are no 
'missing-cable' failures within the cascade. 

The CPU in Unit-0 will note (from the CURRENT.POWER field) that it is 
not adjacent to the powered-down unit and so has sufficient information 
to put its data-path in the 'bypass' mode. The CPU achieves this by 
enabling a special mode within the Switching Engine. Any packets 
received on ports C and D (the right-most ports in Figure 3) of the 
cascade must re-transmitted on those ports without lookup. The CPU 
also adjusts the cascade hashing- algorithm such that cascade packets 
(other than those received on ports C and D of the cascade) are only 
transmitted on ports A and B. The CPU controls the Switching Engine 
modes and cascade hashing-algorithm by writing to control registers 
within the switch ASIC. 

The CPU in Unit-1 will note that it is" adjacent to the powered-down unit 
ajid must heal the cascade by looping-back the data-path. The CPU 
achieves loop-back by adjusting the cascade hashing-algorithm such that 
packets aj e only transmitted on ports C and D (the right-most ones in 
Figure 3). 

The CPU in Unit-3 will note that it is adjacent to the powered-down unit 
and must heal the cascade by looping-back the data-path. The CPU 
adjusts the cascade hashing-algorithm such that packets are only 
transmitted on ports A and B. The CPUs in each of units 0, 1 and 3 will 
also perform the following tasks: 

(a) The CPU will set a switching engine control-register such that the 
Box Bit Mask of transmitted (or re-transmitted) packets does not 
have^ a bit set for Unit-2. Any packets with no BBM bits set will be 
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discarded, as indicated in Figure 5. This will quickly purge the 
cascade of packets destined only for Unit-2. 

.(b) The CPU vvill purge the Forwarding Database of entries relating to 
5 Unit-2, 

The cascade is now in Healed Mode as illustrated by the Healed Ring' in 
Figure 3. 

10 Cacade Operation in Healed Mode 

In this mode, packet forwarding occurs normally, subject to the cascade 
.loop-back and bypass operation that are enabled in the Healed Mode'. 

15 Cascade Operation in the Transition from Healed to Normal Mode 

Consider the stack of switch units in Figure 3. The CCLs in each unit will 
have the register fields shown in Figure 20. Now consider that the power 
for Unit-2 is reconnected and the unit powefs-up. The CCL signalling 

20 mechanism will quickly (within approximately 2ms) update the CCL 

register fields within each unit. The change in value of 
CURRENT _POWER will cause the CCL in each unit to interrupt the CPU. 
The CCL register fields will have the values shown in Figure 19. The 
CPUs in all' units will respond to the interrupt by reading the CCL 

25 registers. The interrupt response time of the CPU in Unit-2 is likely to be 

slower (due a delay caused by the power -up sequence) than that of units 
0, 1 and 3. Since both CURRENT_CTRL and CURRENT_POWER equal 
4Td1111 ail CPUs will conclude that there are four units in the cascade 
and they are all powered up. Since HEAL = 0 the CPUs will also conclude 

30 that there are no "missing-cable" failures within the cascade. The CPU m 

Unit-0 win disable the by-pass mode of the data-path. The CPU will also 
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adjust the casacade hashing-algorithm such that cascade packets are 
transmitted on all four cascade ports (A, B, C and D). 

The CPU in units 1 and 3 change their cascade hashing- algorithm such 
5 that cascade packets are transmitted on all four cascade ports (A, B, C 

and D). This will effectively disable the data-path loop-back. 

The CPU in Unit-2 will set its cascade hashing-algorithm such that 
cascade packets are transmitted on all four cascade ports (A, B, C and 
10 D). The data-path by-pass mode will remain disabled (the default 

configuration) . 

Since Units 0, 1 and 3 respond to the CCL interrupt much more quickly 
than Unit-2 it is possible that these units will heal the data-path (to 
15 include Uiiit.2) before Unit-2 is able to switch packets. This could result 

in the loss of cascade packets for a short period of time. It may be- 
advantageous to delay the healing of the data-path until Unit-2 is fully 
functioning. 

20 The CPUs in all units 0, 1 and 3 also perform the following tasks: 

(a) The CPU must set a switching engine control-register such that the 
Box Bit Mask of transmitted (or re-transmitted) packets allows bits 
to be set for all powered-up units (including Unit-2). 

25 

(b) The CPU must permit the learning of address associated with Unit- 
2. 

The cascade is now in Normal Mode as illustrated by the stack labelled 
'Normal Operation' in Figure 2. 



30 
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Cascade Operation in the Transition from Normal to Healed Mode 
(due to missing cable) 

Consider the stack of switch units in Figure 2 labelled 'Normal 
5 Operation'. The CCLs in each unit will have the register fields shown in 

Figure 19. 

Now consider that the cable between Unit-1 and Unit-2 is removed. 
The CCL signalling mechanism will quickly (within approximately 
10 2ms) update the CCL register fields within each unit. The CCL register 

fields will have the values shown m Figure 21. The change in the value 
of HEAL will cause the CCL in each unit to interrupt the CPU. 

The CPUs in all units will respond to the interrupt by reading the 
15 CCL registers. Since CURRENT„CTRL = 4Td1111 , CURRENT_POWER = 

4T3l 1 1 1 and HEAL = 1 , the CPUs will conclude that the failure is due to a 
missing cable. 

The CPUs in Unit-0 and Unit-3 will note that they are not adjacent to 
20 the missing cable (since their CTRL_OK_UP and CTRL.OK_DpWN 

are both 1). The data paths of units 0 and 3 will be placed in bypass 
mode. The CPU achieves this by enabling a special mode within the 
switching engine. Any packets received on ports C and D (the right-most 
ports in Figure 2] of the cascade must re-transmitted on those ports 
25 . without lookup. The CPU also adjusts the cascade hashing- algorithm 
such that cascade packets (other than those received on ports C and D 
of the cascade) are only transmitted on ports A and B. The CPU controls 
the switching engine modes and cascade hashing- algorithm by writing 
to control registers within the switch ASIC. 



30 
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The CPU in Unit-1 will note that it is adjacent to the missing cable 
(since CTFlL_OK_UP = 0) and it must heal the cascade by looping back 
the data path. The CPU achieves loop-back by adjusting the cascade 
hashing-ajgorithm such that packets are transmitted only on ports C and 
5 D (the right-most ones in Figure 2). 

The CPU in Unit-2 will note that it is adjacent to the missing cable 
(since CTRL_OK_DOWN = 0) and it will heal the cascade by looping 
back the data path. The CPU achieves loop-back by adjusting the 
10 cascade hashing- algorithm such that packets are transmitted only on 

ports A and B. 

The cascade is now in Healed Mode. 

15 



20 




-46- 



CLAIMS 

1. A stack of network units each of which includes a multiplicity of 
5 ports for receiving and forwarding addressed data packets and a 

switching engine for forwarding received packets to at least one port in 
accordance with address data m the packets, and a cascade connection 
comprisini^ for each of two opposite directions around the stack at least 
one unidirectional path for data packets composed of links each between 
10 a respecti\^e cascade port on a unit and a corresponding cascade port on 

the next unit. 

2. A stack of network units according to claim 1 wherein there are at 
least two unidirectional paths for packets for each of the two opposite 

lo directions around the stack, each path including a respective cascade 

port on each unit. 

3. A stack of network units according to claim 1 or 2 wherein in a 
normal mode of operation of each unit the respective switching engine 

20 directs packets received at a cascade port and intended for further 

transmission on the cascade out of the same cascade port and in a loop- 
back mode directs packets received at a cascade port and intended for 
further trcmsmission on the cascade out of a different port and in a 
changed direction of progress around the stack. 

25 

4. A stack of network units according to claim 3 wherein each unit 
includes means for detecting an operational failure between that unit 
and an adjacent unit to cause the unit to enter the loop-back mode. 

30 5. A stack of network units according to claim 3 or 4 wherein each 

unit is operable in a bypass mode wherein packets proceeding on the 
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cascade c*mnection and received at a cascade port are forwarded from 
the same j^ort without being subject to the switching engine. 

6. A stack of network units according to claim 3, 4 or 5 wherein each 
5 switching engine is responsive to control data conveyed between the 

units on ctmtrol paths separate from the said unidirectional paths. 

7. A st ack of network units according to claim 6 wherein said control 
paths axe constituted by a chain of half-duplex links each from one unit 

10. to the next and wherein cascade control logic in each unit responds to 

presence and absence of control frames on respective links to control the 
switching engine of the respective unit. 

8. A stack of units according to any foregoing claim wherein each 
15 packet which is received at any given port from an external network and 

is forwarded onto the cascade connection includes a header source field 
which uniquely across the stack identifies the given port and the 
respective unit. 

20 9. A stack of units according to claim 8 wherein the header source 

field is a multiple bit binary field of which a first plurality of bits 
identifying the given port within its unit and a second plurality of bits 
identify the unit. 

25 10. A stack of units according to any foregoing claim wherein each 

packet which is forwarded onto the cascade connection includes a header 
destination field for identifying a destination unit and a port thereon and 
another field indicating the validity of the header destination field. 




11. A stack of units according to claim 8 or 9 and claim 10 wherein the 
header source field and the header destination field have the same 

format. 

5 12. A stack of units according to any foregoing claim wherein each 

packet which is forwarded onto the cascade includes a header portion . 
which indicates which of the units have been traversed by the packet. 

13. A stack of units according to claim 12 wherein the header portion 
10 is a bit mask. 

14. A network unit which is capable of use in a cascade stack of 
network units and including a multiplicity of ports for receiving and 
forwarding addressed data packets and a switching engine for forwarding 

15 received packets to at least one port in accordance with address data in 

the packets, the unit including at least two cascade ports for connection 
to other units in the stack, said unit having under control of the 
switching engine a normal mode wherein packets received at any of the 
cascade ports are forwarded from the same port in the same direction of 

20 progress and a loop-back mode wherein a packet received at a cascade 

port is forwarded from a different cascade port in a different direction of 
progress. 

15. A network unit according to claim 14 wherein the switching engine 
25 is responsive to control data indicating the operational status of other 

units in the stack to determine the mode. 

16. A network unit according to claim 14 or 15 wherein the unit has a 
bypass mode in which packets received at a cascade port and intended 

30 for further transmission on said cascade are forwarded from the same 

port without being subject to the switching engine. 
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17. A n<?twork unit which is capable of use in a cascaded stack of 
network units and includes a multiplicity of ports for receiving and 
forwarding addressed data packets, a switching engine for directing 

5 received packets to at least one port m accordance with address data in 

the packets and at least two cascade ports for receiving packets from and 
sending packets to adjacent units in the cascaded stack, the unit 
including means for detecting an operational failure between this unit 
and an adjacent unit in the stack and for controlling the switching engine 
10 to redirect packets which would otherwise be sent from a particular port 

to that adjacent unit to be forwarded from another port whereby to be 
sent to a different unit in the stack. 

18. A network unit according to claim 17 wherein the unit is 
15 responsive to control data indicating an operational failure between two 

other units in a stack to enter a bypass mode to cause packets received 
at a cascade port and intended for further transmission on the cascade to 
be forwarded without being re-directed by the switching engine. 

20 19. A network unit according to claim 17 or 18 wherein the unit has at 

least one cascade port for reception and forwarding of packets in a first 
direction around the cascade and at least one cascade port for reception 
and forwarding of packets in a second direction around the cascade. 

25 20. A network unit according to claim 19 wherein the unit has at least 

two cascade ports for each of the first and second directions. 

21. A network unit according to any of claims 14 to 20 wherein the 
unit includes control logic for forwarding control frames to and receiving 
30 control frames from one each of two control paths and for thereby 
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determining the operational status of other units so as to control said 
switching engine. 

22. A n<-twork unit according to claim 17 and claim 21 wherein the 
5 control logic detects said operational failure. 

23. A network unit according to any of claims 14 to 22 and arranged to 
provide foz- each packet that is forwarded from a cascade port a header 
which includes a destination port field that identifies a destination port 

10 and the unit on which the port is located 

24. A network unit according to claim 23 wherein the header includes 
a field that indicates the validity of the destination port field. 

15 25. A network unit according to claim 23 or 24 wherein the said header 

includes a source port field which identifies a port by which the packet 
has been received and the unit on which the source port is located. 

26. A ne^twork unit according to any of claims 14 to 25 wherein the 
20 unit provides for each packet that it forwards from a cascade port a 

header portion which identifies which units in a stack have and have not 
been traversed by the packet. 

27. A network unit according to claim 26 wherein said header portion 
25 is a bit mask. 

28. A network unit according to claim 26 or claim 27 wherein the unit 
IS arranged in response to the said header portion to discard the packet if 
the header portion indicates that the packet has already traversed the 

30 unit. 
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29. A network unit according to any of claims 26 to 28 wherein the 
unit is arranged in response to the header portion and either to an 
indication that a destination port for the packet is known to determine 
whether tlie destination port is on the unit or to an indication that a 

5 destination port is unknown to perform a look-up in an address database 

for the destination port. 

30. A ne-twork unit capable of use within a cascaded stack of network 
units, and including a multiplicity of ports for receiving addressed data 

10 packets from and sending packets to an external network, a look-up 

database for relating address data in packets to forwarding data which 
includes an identification of at least one port and a switching engine 
which responds to forwarding data to direct packets to at least one 
corresponding port, and at least one cascade port for transmission of 

15 data packets between the unit and other units in the cascade stack, 

wherein the unit provides for each packet that it receives at one of the 
said multiplicity of ports and forwards from a cascade port a header field 
that indicates whether a destination port in the stack is known for the 
packet and a header field identifying the destination port arid the unit on 

20 which the destination port is located. 

31. A network unit according to claim 30 wherein the header includes 
a source port field that identifies a port at which the packet has been 
received and the unit containing that port. 

32. A network unit according to claim 30 or 31 wherein each packet 
forwarded from the cascade port includes a field which has portions for 
denoting which of the units of the stack have been and have not been 

traversed by the packet. • 
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33. An addressed data packet for use within a cascaded stack of 
network units which each have a multiplicity of ports for receiving and 
forwarding data packets, the addressed data packets including a header 
which includes a first field and a second field which are in a common 
format and uniquely identify a source port and unit on which the packet 
has been l eceived and a destination port and unit from which the packet 
is to be forwarded. 

34. An addressed data packet according to claim 33 wherein each of 
the first and second fields is a multiple-bit word of which a first plurality 
of bits denote a port and a second plurality of bits denote a unit. 

35. An addressed data packet according to claim 33 or 34 and 
includmg a third field for indicating whether the destination port and 
unit are known. 

36. An addressed data packet according to any of claims 33 to 35 and 
including portions which conform the packet to an Ethernet transmission 
protocol. 

37. An addressed data packet for use within a cascaded stack of 
network units which each have a multiplicity of ports for receiving and 
forwarding data packets, the addressed data packets including a header 
including a field for indicating whether the destination port and unit are 
known and a field for identifying the destination port and unit from 
which the packet is to be forwarded. 

38. An addressed data packet according to claim 33 wherein each of 
the identifying field is a multiple-bit word of which a first plurality of bits 
denote a port and a second plurality of bits denote a unit. 
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39. An addressed data packet according to any of claims 37 to 38 and 
including portions which conform the packet to an Ethernet transmission 
protocol. 

5 40. A network unit capable of use within a cascaded stack of network 

units, and including a multiplicity of ports for receiving addressed data 
packets from and sendmg packets to an external network, a look-up 
database for relating address data in packets to forwarding data which 
includes an identification of at least one port and a switching engine 

10 which responds to forwarding data to direct packets to at least one 

corresponding port, and at least one cascade port for transmission of 
data packets between the unit and other units in the cascade stack, 
wherein the unit provides for each packet that it forwards from a cascade 
port a header field which has portions for denoting which of the units of 

15 the stack liave been and have not been traversed by the packet. 

41. A network unit according to claim 32 or claim 40 and including 
means foj- discarding the packet when the respective header field 
indicates that the packet has already traversed the unit. 

20 

42. A network unit according to claim 41 wherein the unit discards the 
packet if the respective header field indicates that the packet has 
traversed all the units in the stack. 



25 
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ABSTRACT 

Network units such as a switch for use in a cascaded stack are 
organised to provide a cascade connection in the form of a dual 
unidirectional connection so that, in its ordinary configuration, there is 
at least one and preferably more than one unidirectional ring for each 
direction ?iround the cascade, each ring including a respective port on 
each unit. For each ring, each port on a unit is connected by a respective 
link to a corresponding port on the preceding unit and the following unit. 
The units provide a self-healing operation in the event of various kinds of 
operational failure. The self-healing operation includes loop-back of 
packets in units adjacent the failure and bypass of a packet switching 
process for other units. The units include control logic for passing control 
frames containing status information relating to the units and links 
between them and for co-operation with a CPU to control a switching 
engine to perform the self-healing operation in accordance with that 
status information. The units forward on the cascade packets with 
headers that identify a destination port and the unit on which that port is 
located and also indicate which units have and have not been traversed 
by a packet. 
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always @(posedge elk or negedge reset) 
begin 

if (reset) begin 

iTi7_regs__UNIT_ID <= 0; 
irY_regsjy[ISCDHFIG_PEQ <= FALSE; 
my_regsJ^EI^_REQ <= FALSE; 
end 

else begin . . 

nTy_^regs__UNIT_ID <= ny^regsJUNITJCD; 
rnyjregsJffiSCX)lSIFIG_REQ <= FALSE; 
my_regsJ?ENUM_RBQ <= FALSE; 

if (PRIMARY) begin 
if (RESILENT) 

nT7_j:egsJUNIT_ID <= 0; 

end 

else begin 
if ( ldawn_regs_IDLE) begin 

if (down_regsJJNIT_II>=3 && down^regsJSESCTjClR^^ 

iTY_regs_MISCONFIG_REQ <= TRUE; 
else if (dcv;n_j:egsJUNIT_ID =3) 

itYjregsJ^EISIUM_RBQ <= TBUE; 
else 

iTTy_regsJUNIT_ID <= doMn_reqs_UNIT ID + 1; 
end 
end 
end 
end 

always 

begin // generate UNIT__ID fron NEXT^CTRL 

casex (dc^_regs_NEXT_CrRL) 

4'blxxx: daA7n__regs__UNIT_ID = 3; 
4'bOlxx: down_regs_UNIT_ID = 2; 
4'bOOlx: down_regs_UNIT_ID = 1; 
default: dcwn_regsJLJNIT_ID = 0; 
endcase 
end 
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always @(pc3edge elk or negedge reset) 

begin ' , 

if (reset.) begin 

iTTy_regsjClJRPEOT_PO^ <= 4'bOOOO; ' , , 

iny^regs^NEXT^PCMER <= 4'bOOOO; 
^itf^::^i^J:egsj:xjFP^ ' bOOOO ; 

Mini^;;jE:eg3^^ ' <^ 4 'bOOOO; 

end 

else begin 

if ( !down_regs_IDLE) begin 
if (PRIMARY) begin 

iTr/_regsjGJRREOT_POWE^ <= dc^_regs_NE2<T_Pa^; 
iTr/_regsJ^lEDa;_POW^ <= MY_POWER « iTy_regsJUNIT__ID; 
// MYJPOWER is a sanple of the input Unit_PaK?er_Sense 
iTr/_regsjCIJRRENTj:jI^ <= damjoegsJ^E^JJTPlj; 
myjcegsJ>^EXTJJT9Ju <= I'bl « m7_regs_UNITJLD; 

end 

else begin 

n^_regsj3inRREIsrr_P0WER <= dcwn_regs_CURRENT PCWER; 
my_regsJ^5EXT_PCWER <= down_regsJSEOT_PCXNlE^ (MYJPOtSER « 
nry^regs JUNIT_ID ) ; 

iTy_regsJXIRRElSlT_CTRL <= down_regs_aJRREnOT CTRL; 
iny_regsJ^IEXTjC::T^ <= down_regsJsIE3a'jCT^ J (I'bl « 
rny__regsJLJNIT_ID) ; 
end 
end 

else b2gin 

// leave registers unchanged 
end 
end 
end 
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